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Abstract 

o 

While there is substantial need for dependence models in high dimensions, most exist- 
ing models strongly suffer from the curse of dimensionality and barely balance parsimony 
and flexibility. In this paper, the new class of hierarchical Kendall copulas is proposed 
^ which tackles these problems. Constructed with flexible copulas specifled for groups of vari- 

OO ables in different hierarchical levels, hierarchical Kendall copulas are able to model complex 

dependence patterns without severe restrictions. The paper explicitly discusses inference 
I— I techniques for hierarchical Kendall copulas, in particular, simulation, estimation and model 

selection. A substantive application to German stock returns finally shows that hierarchical 
Kendall copulas perform very well, out-of- as well as in-sample. 

Keywords: multivariate copula, hierarchical copula, Kendall distribution function 

1 Introduction 

> 

Dependence modeling using copulas has made significant progress in the last years. Many of the 
Q\ standard, and also of the newly proposed, models however suffer from the curse of dimensionality, 

which makes it virtually impossible to use them for very large data sets as required, e.g., in 
hnancial or spatial applications. One common approach to overcome the curse of dimensionality 
^ is grouping data, e.g., by industry sectors or nationality. Such copula models include the grouped 



S3 



Student-t copula by Daul et al. (2003) and hierarchical Archimedean copulas, which were 



initially proposed by Joe (1997)| In particular, hierarchical structures such as the latter are 



^ very appealing and received considerable attention lately (see, e.g., Hofert (2010)) 



A major issue of any copula model is to find a good balance between parsimony and flexibil- 
ity. While elliptical copulas such as the common Gaussian and Student-t require an enormous 
number of parameters for specifying the correlation matrix (the number of parameter grows 
quadratically with the dimension), Archimedean and also hierarchical Archimedean copulas are 
much more parsimonious, since the number of parameters is at most linear in the dimension. 
However, such restrictions may be severe, since hierarchical Archimedean copulas are at the 
same time limited to the class of Archimedean copulas as building blocks. 

The purpose of this paper is to introduce the new class of hierarchical Kendall copulas, 
which is a flexible but yet parsimonious copula model available for applications also in higher 
dimensions. It is built up by copulas for groups of variables in different hierarchical levels. 
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In particular — and in contrast to hierarchical Archimedean copulas — , it does not require any 
restrictions with respect to copula choice or the parameters and therefore is able to model 
complex patterns of dependence such as asymmetry and tail dependence between large numbers 
of variables. 

The name "hierarchical Kendall copula" is chosen to stress the central role of the Kendall 
distribution function in the model formulation. The Kendall distribution function is the distri- 
bution function of the multivariate probability integral transform using the copula. It is used to 



aggregate the (dependence) information of a group of variables. It was first studied in Genest 



and Rivest (1993) in the bivariate case and in more detail in 


Barbe et al. (1996) Other accounts 


on it can be found, e.g., in Imlahi et al. (1999) Chakak and Imlahi (2001) , jGenest and Rivest 


(2001) and Nelsen et al. (2003) as well as in the copula goodness-of-fit literature (see, e.g., 


Wang 



and Wells (2000)). 



It has been shown by Genest et al. (1995) that the only copula which gives a valid multivari- 



ate distribution for non-overlapping multivariate marginals is the independence copula. Marco 



and Ruiz-Rivas (1992) state conditions how a distribution function with specified multivariate 
marginals can be constructed; the easiest case being that margins are max-infinitely divisible, 
which includes distributions based on Archimedean copulas. Hierarchical Kendall copulas cir- 
cumvent such issues through aggregation facilitated by the Kendall distribution functions. This 
implies that in general the obtained multivariate distribution is not a copula. 



The model, which I call hierarchical Kendall copula, has previously been mentioned by Anjos 



and Kolev (2005) , who however do not further develop the model in terms of statistical properties 
and inference. The work presented here is completely independent of theirs and develops in detail 
how hierarchical Kendall copulas can be used for statistical inference; simulation, estimation 
and model selection are discussed explicitly. Finally, a substantial 30-dimensional application to 
German stock returns is presented, showing the good out-of- as well as in-sample performance 
of hierarchical Kendall copulas. Hierarchical Kendall copulas therefore answer the need for 
parsimonious and flexible models in high dimensions and provide means to overcome the curse 
of dimensionality in dependence modeling. 

The remainder of the paper is organized as follows. The new model is introduced and 
discussed in Section [2] Section |3] then treats statistical inference technique for hierarchical 
Kendall copulas, where an extensive simulation study is performed to validate the estimation 
methods. In Section [4] the financial application is presented in detail, while Section [5] concludes. 



2 Hierarchical Kendall copulas 

A central part of the definition of hierarchical Kendall copulas, which will be given below, is 
the notion of the Kendall distribution function which therefore is treated first. Subsequently, 
hierarchical Kendall copulas are defined and their properties are discussed, in particular in 
contrast to hierarchical Archimedean copulas. 



2.1 Kendall distribution functions 



Kendall distribution functions were first studied in two dimensions by Genest and Rivest (1993) 



and studied in more generality by Barbe et al. (1996) They are defined as follows. 
Definition 2.1 (Kendall distribution function). Let C be a d- dimensional copula. For U :- 
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{Ui, Ud) ~ C the Kendall distribution function is defined as 

K^'^){t):=P{C{U)<t), tG(0,l). 



(2.1) 



In other words, the Kendall distribution function is the univariate distribution function 
of the random variable Z := C{U), the multivariate probability integral transform of U. It 
has been shown by Genest and Rivest (1993) that bivariate Archimedean copulas are uniquely 
characterized by their Kendall distribution functions. Genest et al. (2011) recently extended 
this result to the trivariate case and strongly conjecture that this holds in general. 

For my purposes, a specific interpretation of Kendall distribution functions will be particu- 
larly useful. First, I define the univariate function 

Cui,...,ua-i{ud) ■= C{ui, ...,Ud^l,Ud) 

and denote the corresponding inverse as ^"^"1 which is the quantile function of a copula as 



studied in Imlahi et al. (1999) and Chakak and Ezzerg (2000) 



smce 



C(ni,...,Mrf_i,C„^^,.._„^_^(z)) = z 

for z € (0, 1). This function will be useful for describing the level set of a copula 

L{z) = {ue [0,1]'^ ■.C{u) = z}, zG(0,l), 

which is closely related to the Kendall distribution function, since 

PiLiz)) = P{CiU) = z) =: k^''\z), 

if the Kendall distribution function K^'^') is differentiable with density k^'^^ . 
For ease of notation I further define for r < d — 1 



(2.2) 



and for z G (0, 1) 



Cu-l_,...,Ur{'^d) '• — Cui,...,Ur, 



Ca\z) := z. 



Example 2.2 (Quantile function of an exchangeable Archimedean copula). For a d- dimensional 
exchangeable Archimedean copula (see McNeil and Neslehovd (2009)) with generato^ip, 

C(mi, ...,Ud) = (X]i<i<rf f'^'^i)) ' 
the quantile function can by determined as 

Using the above notation, the following theorem provides a way to compute Kendall distri- 
bution functions of arbitrary copulas. 



'^The function generates a d-dimensional exchangeable Archimedean copula if and only if its inverse ip~^ is d- 
monotone on [0, oo), that is, Lp~^ is differentiable up to the order d— 2 on [0, oo), it holds that (— (a;) > 
for fc = 0, 1, d — 2 and for any x G [0, oo), and (— 1)'*~^(<^~^)''^~^^ is non-increasing and convex on [0, oo) (see 



McNeil and Neslehova (2009) I. 
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Family Generator (p{t) Kendall dist. function K{t) 

Independence — log(t) — log(t)) 

Clayton t"^ - 1, > t (l + 

Gumbel (-log(^))^ 0>l t (l - 

Frank _iog(i^), g M \ {0} t+'-^log(^ 



Joe -log(l-(l-tn ^>1 t - jfc^^-^)^) 

Countermonotonic 1 — t 1 
Comonotonic - i 

Table 1: Generator and Kendall distribution functions of bivariate Archimedean copulas as well 
as the Kendall distribution functions of the bivariate Frechet-Hoeffding lower and upper bounds. 

Theorem 2.3 (Kendall distribution function of a d-dimensional copula). Let U := {Ui, Ud) ~ 
C , where C is absolutely continuous with density c. If d = 2, the Kendall distribution function 



i^(2) 

is given by \Chakak and Ezzerg 200C^ 



K^'^\t) = t+ / / c{ui,U2)du2dui = t + / - — C(ni,U2) 

Jt Jo Jt " 



dui 



Following Imlahi et al. (1999)\ the Kendall distribution function of a d- dimensional copuh 



a can 



then be obtained recursively through 

= i^(^-i)(t)+ [' [' ... [' r""""^'~'^'\{ui,...,Ud) dud...dui, 

Jt JCu,\t) JCul...,ua_2{t) Jo 

where K^'^'^ denotes the Kendall distribution function of the d-dimensional copula and 
that of the (d — 1) -dimensional margin of the first {d — 1) variables. 

For general copulas it is not possible to easily determine the Kendall distribution function 
in closed form. A convenient exception are however (exchangeable) Archimedean copulas. 

Theorem 2.4 (Kendall distribution function of an Archimedean copula). Let U := {Ui, Ud) ~ 
C, where C is an exchangeable Archimedean copula. Then the Kendall distribution function is 



given by (Barbe et al. 1996) 



In particular, if d = 2, 



K^'\t) = t + Y^[-iy^^{^-')^^{^{t)). 

2=1 



with the X-function first studied in\Genest and Rivest (1993) 



Table[T]shows generator and Kendall distribution functions of common bivariate Archimedean 



copulas (see |Joe (1997) and Nelsen (2006) ) as well as of the Frechet-Hoeffding lower and up- 
per bounds. In case of the independence copula, the Kendall distribution function for general 
dimensions also has a simple closed-form expression. 
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Figure 1: Kendall distribution functions of the independence copula (2.3) (left panel) and the 
Gumbel copula with parameter 9 = 2 (right panel) for different dimensions d. 



Example 2.5 (Kendall distribution function of the d-dimensional independence copula). The 

Kendall distribution function of the d- dimensional independence copula is conveniently given as 

i^('^)(t) = t+t|;MlM. (2.3) 



Example 2.5 shows that the Kendall distribution function of the independence copula is, 
for fixed t, increasing with the dimension d. Since Kendall distribution functions are obviously 
bounded from above by 1 — which in two dimensions is the Kendall distribution function of the 
Frechet-Hoeffding lower bound — , the Kendall distribution function of the independence copula 
converges to 1 as d — >• oo. This is illustrated in the left panel of Figure [l] 

For comparison the right panel of Figure [T] shows the Kendall distribution function of the 
Gumbel copula with parameter 9 = 2 (medium positive dependence). While here the Kendall 
distribution function also converges to 1 as d — >• oo, this convergence is much slower. The 
practical implications of this property on my model will be discussed below. 

In the following, the superscript of the Kendall distribution function, which indicates the 
dimension of the associated random vector, will be omitted for reasons of readability. 

2.2 Model formulation and properties 

The definition of the new dependence model class of hierarchical Kendall copulas is now stated. 



Although the model has previously been formulated by Anjos and Kolev (2005) , it has — to the 
best of my knowledge — not yet been treated in detail or used for statistical inference. I choose 
the name hierarchical Kendall copula to stress the central role the Kendall distribution function 
plays in the model formulation. 

Definition 2.6 (Hierarchical Kendall copula). Let Ui, Un ~ U{0, 1). Further let Cq, Ci, 
be copulas of dimension d for Cq and 1 < Ui < d for Ci, i € {l,...,d}, where n = Yli=i''^i' 
respectively. Moreover let i , . . . , -fC^ denote the Kendall distribution functions corresponding to 
Ci, Cd, respectively, and define rm = X]}=i "-j /^'^ i = 0, d. Then the hierarchical Kendall 
copula is defined as follows: 

(i) {Um,_i+i,-,Umi) -^Ci^iG {!,..., d}, 

(ii) {Vu ...,Vd) ~ Co, where Vt := Ki{Ci{Um,_,+i, ...,C/^J) Vi G {1, ...,4. 
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Ci(C7i) ~ A'l 









C2(U2)~K2 






V2 :=A'2(C2(C/2))~ (7(0,1) 







{yi,...,Ki)~Co 



V'd := Ki{Cd(Ui)) ~ (7(0, 1) 



Figure 2: Illustration of the hierarchical Kendall copula model of Definition 2.6 



Definition 2.6 is illustrated in Figure [2] Note that Cq is in general not a copula of (f/i, J7„) 
but of (Vi,...,Vd), which are uniform random variables, since Ci(C/m,_i+i) f^mj ^ Ki yi £ 
{!,... Typically Cq is denoted as nesting copula, while (C/mj-i+i, •••) C^mJ, ^ £ {l)---)^^}) is 
called cluster. The nesting copula Cq and the cluster copulas Ci, ...,Cd can be chosen indepen- 
dently. They can be arbitrary copulas such as common Archimedean or elliptical copulas or 
from any other class of copulas. From now on it is assumed that they are absolutely continuous 
and possess densities cq and ci, ...,Crf, respectively. 

The Kendall distribution functions Ki,...,Kd are used to summarize the information con- 
tained in the clusters (transformation of nj-variate to univariate random variable). While one 
may also think of other transformations, I believe that Kendall distribution functions are par- 
ticularly useful for this purpose. First, since the Kendall distribution function is monotone, Cq 



essentially models the comovement of the contour levels (2.2). This can be seen as a proxy 
for the strength of dependence in the clusters, since the dimensionality of the single clusters 
is "normalized" through the Kendall distribution functions. Second, other variables such as 
components of an elliptical distribution do not contain such specific information which summa- 
rizes the information in a single variable. And third, a focus on tails, as it is often desired, is 
empirically hardly feasible, since tail behavior is very hard to quantify appropriately. Thus, the 
transformation using the Kendall distribution function reasonably summarizes the (dependence) 
information of a multivariate random vector. For the particular purpose of risk aggregation an 
alternative hierarchical dependence model for sums of random variables was recently proposed 



by Arbenz et al. (201 1^ 



The two-level construction given in Definition 2.6 can easily be defined for an arbitrary 
number of levels. 



Definition 2.7 (Hierarchical Kendall copula with k levels). Let Ui, Un ~ f^(0, 1) and di > 

d2 > ••• > dk-i- Further let Cq be a dk-i-dimensional copula and c\^\ j = 1, ■■■,k — 1 (index 

running over the k levels), i = l,...,dj (index running over the dj clusters of each level j), 

-di „(i) _j J _ 



be copulas of dimension 1 < n 



< dj, where n 



n 



and dj^i 



nl for J 



2,...,k 

l,...,k — l,i = l,...,dj, and define rn'p = ^ 
Kendall copula with k levels is defined as follows: 



1. Further let Kf denote the Kendall distribution function corresponding to c\''\ j 



j=i iT-j for I 



0,...,d. Then the hierarchical 



(i) {U 



(1). 



ViG{l,...,di}, 
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■■.(7,„(.i 













V/" :=ifi(Ci(C/i))~!7(0,l) 



Vt' :=Ke(Ci(Ut))~UlO,l) 





(vf,.. 
















> := 


Xi(Ci(Vi)) ' 


-!7(0,1) 






V],' :=-ff*(Q,(V,J)~f/(0.1) 



Figure 3: Illustration of a three level hierarchical Kendall copula model (see Definition 2.7). 



Here m 



(2) 
1 



(in) for j = 2, A; - 1.- 



An example of a three level hierarchical Kendall copula is shown in Figure |3j For simplicity 
and illustrative reasons I restrict my exposition here to the case oik = 2 hierarchical levels. All 
derivations and methods described in the following can easily be generalized to the general k 
level case. 

The joint density function of a hierarchical Kendall copula is now derived under certain 
independence assumptions. 

Theorem 2.8 (Joint density of a hierarchical Kendall copula). Let U = (Ui, ...,Un) be dis- 
tributed according to a hierarchical Kendall copula. Further let Ui = {Urm^ i+i, ■■■,Umi) and 
Ui = {umi_i+i, ■■■,Umi), i = l,...d. Similarly u = {ui,...,Un). If 

Ui±Uj\{Vu...,Vd) and Ui ±Vj\Vi ^ j, ij £ {I, d}, (2.4) 

then the joint density function cjc of a hierarchical Kendall copula is given as follows: 



cjc{u) = co{Ki{Ci{ui)), Kd{Cd{ud)))'[{ci{ui). 
Proof: By adding only redundant information, one gets 



(2.5) 



i=l 



ciciu) = P{U = u) = P{U = u,Vi = KiiCiiu^)), Vd = KdiCdiud))) 

= P{U = u\Vj = Kj{Cj{u,)),j = 1, ...,d)P{Vj = Kj{Cj{uj)),j = 1, d), (2.6) 
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where Vj := Kj{Cj{Uj)), j = l,...,d. Following Definition 2.6 the second term of Equation 



(2.6) is then given by 

P{V, = Kj{C,{uj)),j = l,...,d) = co{Ki{Ci{ui)),...,KdiCd{u,))). (2.7) 



On the other hand, the first term of Equation (2.6) simplifies to 



P{U = u\Vj = Kj{C,{u,)),j = 1, ...,d) = llP{Ui = u,\Vj = K,{Cj{ui)),j = 1, 

i=l 
d 

= llP{Ui = u,\Vi = Ki{Ci{ui))), 



i=l 



because of condition (2.4) which in particular implies that Ui\{Vi, ...,Vd) = Ui\Vi. 
Further, 

P{Ui = Ui\Vi = K,{Ci{ui))) = P{Ui = u,) = c^{ui) Vi = 1, d, 
since the conditioning is redundant information. 



(2. 



The proof concludes by plugging Equations (2.7) and (2.8) into Equation (2.6) 



□ 



I do not regard condition (2.4), which perfectly fits into the setting of Definition 2.6 as 



rather restrictive. The intuition behind the two assumptions is that, given the information of 
the nesting variables Vi, ...,Vd, the clusters are independent of each other and also of other nesting 
variables, since the dependence among the clusters is explained through the "representatives" 
Vi, ...,V(i- In numerical experiments this was confirmed empirically. 



Theorem 2.8 now allows to state the following corollary which summarizes the marginal 
properties of hierarchical Kendall copulas. 

Corollary 2.9 (Margins of a hierarchical Kendall copula). The same notation as in Theorem 
\2.8\ is used. 

(i) Bivariate margins: Let k,i £ {1, ...,n}, k ^ L W.l.o.g. k < i. 

(a) IfUk and Ui are in the same cluster i, their marginal distribution function Cki is 

Cke{uk,ue) := Ci{l, l,Uk, 1, l,ue, 1, 1). 



(b) If Uk and Ue are in different clusters and condition (2.4) holds, their marginal dis- 
tribution function Cm is 

Ckeiuk,ui) := / / cic{ui,...,Uk-i,u!i,Uk+i,-..,ue-i,W2,ue+i,...,Un) 
Jo Jo J[0,l]"-2 



dui...duk-iduk+i--- dug- 1 dug^ i . . . dw2 dw i . 



(2.9) 



(a) Multivariate margins: The marginal distribution of the cluster Ui is Ci 
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Proof: Claims (i)(a) and (ii) directly follow from Definition 2.6 With regard to (i)(b), Theorem 
2.8| yields the bivariate marginal density Cki of Uk and Ui as 



Cke{uk,Ui)= ciciu) dui...duk^iduk+i...dui^idui+i...dun, (2-10) 

J[o,i]"-2 



from which Equation (2.9) follows through integration. □ 



More general multivariate margins involving variables from different clusters can be derived 



as in Equation (2.9) 



Furthermore bivariate marginal distributions where the variables are in different clusters can 



be regarded as some kind of continuous mixture of the nesting copula Cq. By plugging (2.5) 



into (2.10), the density of Cke as defined above is given by 



^ d 

Cke{uk,ue)= / co{Ki{Ci{ui)), ...,Kd{Cd{ud)))Y\ Ciiui) 

dui...duk-iduk+i---du£^idu£^i...dun, 
where the mixing density weights are given by Ilf=i Ci(wi). 

2.3 Hierarchical Kendall copulas vs. hierarchical Archimedean copulas 

The popular class of hierarchical Archimedean copulas also allows for a nested modeling of clus- 
ters of variables. In contrast to hierarchical Kendall copulas, hierarchical Archimedean copulas 
are however limited to Archimedean copulas as building blocks, while hierarchical Kendall cop- 
ulas can be built up by any possible copula. Furthermore, hierarchical Kendall copulas do not 
have any restrictions regarding the dependence, while hierarchical Archimedean copulas require 
stronger intra-cluster dependence, which results in parameter restrictions if generators are the 
same. 

As will be discussed below, hierarchical Kendall copulas also benefit from many of the con- 
venient properties of Archimedean copulas. It will be shown that closed-form sampling of hi- 
erarchical Kendall copulas is feasible, when cluster copulas are (exchangeable) Archimedean. 
Hierarchical Kendall copulas with Archimedean cluster copulas are further particularly easy to 
estimate, since Kendall distribution functions are known in closed form for Archimedean cop- 
ulas. For that reason they also provide a closed-form joint density function (under condition 



(2.4)), which is numerically tractable even in higher dimensions. 

In contrast to hierarchical Archimedean copulas the nesting copula Co is however not closed 
under addition and removal of cluster components Ue. This is because the Kendall distribution 



function is not independent with respect to the dimension (cp. Theorem 2.3). That is, if a 
random variable Un+i is added to cluster j G the transformation Kj and thus Vj 

change, even if Cj is exchangeable Archimedean. Similarly, if a random variable is removed 
from a cluster. 



3 Inference of hierarchical Kendall copulas 

In the following, statistical inference techniques for hierarchical Kendall copulas are discussed. 
First simulation is treated, then estimation and model selection. 
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3.1 Simulation 



The following general sampling procedure describes how to obtain one single sample from a 
given hierarchical Kendall copula. 

Algorithm 3.1 (Simulation of hierarchical Kendall copulas). Let Co, Ci, Cd he known. Fur- 
ther let K^^ denote the inverse of the Kendall distribution function Ki for i = and 
assume that it is also known. 

(i) Obtain a sample (ui, from Cq. 

(ii) Setz., ■.= K-\vi) Vi G {1,...,4- 

(Hi) Obtain a sample {um,^^+i, ■■.,Um,) from (Um.^^+i, .■.,UmJ\Ci{Um,_i+i, ■■■,UmJ = Zi for 
i = 1, d. 

(iv) Return u := (ui, 

Given that one can simulate from the copula Cq, sampling from hierarchical Kendall copulas 
thus amounts to the more general question of sampling from a distribution U\C{U) = z, where 
C is the copula of a marginally uniform random vector U := (f/i, ^7^) and z £ (0,1). In 
other words, one wants to sample from a multivariate distribution given a specific level set 
(2.2) at level z as illustrated in the left panel of Figure |4| This problem is discussed in the 



following three sections, where the first section solves the issue using (approximate) rejection 
sampling, while the second provides expressions for conditional distribution functions, which are 
shown to be available in closed form for exchangeable Archimedean copulas. The third section 
provides an alternative solution for Archimedean copulas, which is based on recent results from 
the literature. 



Before moving on, I however like to note that in contrast to the assumptions in Algorithm 3.1 



the Kendall and the copula distribution functions may not be known in general. This problem 



is treated explicitly in Section 3.2 where estimation of hierarchical Kendall copulas is discussed. 
When Kendall distribution functions are known, inverses are typically obtained by numerical 
inversion, since closed-form expressions are often not available. 

3.1.1 Rejection sampling 



To obtain an approximate sample from U\C(U) = z, rejection sampling can be used as in Imlahi 



et al. (1999) Instead of sampling from U\C(U) = z, one chooses a small number e > and 



sample from U\{z — e < C{U) < z + e). This is illustrated in the middle panel of Figure |4| 
Algorithm 3.2 (Rejection sampling). Let z £ (0, 1) and e > 0. 

(i) Obtain a sample u := {ui, ...,Ud) from C. 

(ii) If \C{u) — z\ < e, return u. Otherwise go hack to step (i). 

The right panel of Figure |4] shows an exemplary sample from a Clayton copula with parameter 
= 2 aX contour level z = 0.2. For illustrative reasons e is chosen as 0.01. 



Algorithm |3.2| can efficiently be combined with Algorithm 3.1 in order to simultaneously 
obtain N samples from a hierarchical Kendall copula. 
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Figure 4: Left panel: observations from a bivariate Clayton copula with parameter 9 = 2 and 
level sets at levels z = 0.1,..., 0.9 (solid lines). Middle panel: level set at level z = 0.2 of the 
bivariate Clayton copula distribution function with parameter 9 = 2 (solid line). The dashed 
lines illustrate the rejection sampling area [z — 0.01,2; + 0.01]. Right panel: scatter plot of 
corresponding samples generated through rejection sampling. 



Algorithm 3.3 (Simulation of hierarchical Kendall copulas using rejection sampling). Let Co, 

Ci, ...,Cd and K^^ , KJ^ be known. Further let e > be given. 

(i) Obtain N .samples {vji, ...,Vjd), j = 1, from Cq. 

(a) For i = 1, d: 

(a) Set Zji := Kr^ivji) Mj £ {1,...,N} and define D := {l,...,iV}. 

(b) Obtain a sample u := (um- ^+i, ...,Umi) from C. 

(c) If for any f G {l,...,iV}, \C{u) - Zj>i\ < e, set (uj/,m^_j+i, Uj/,^,) := it and 
D := D\ Go back to step (b) until D = %. 

(Hi) Return samples {uji, ...,Ujn), j = 1,...,N,. 

The computational efficiency obviously depends on the choice of e. The smaller e is, the 
longer Algorithm 3.3 takes, but, at the same time, the more accurate the results are. Due to 
rounding errors in simulation and calculation of the copula distribution function, it is in fact 
reasonable to allow for an error e. 



3.1.2 Conditional inverse method 

A common method to generate samples from a multivariate distribution is the conditional in- 



verse method (see Devroye (1986)). For this one needs to determine the iterative conditional 
distribution functions of U\C(U) = z, that is, of Ui\C{U) = z, U2\{Ui = ui,C{U) = z), 
Ud\{Ui = Ul, ...,Ud-i = Ud-i,C{U) = z)j^ The corresponding conditional distribution func- 
tions are denoted by Fj^i^ j_i{-\ui,...,Uj-i,z) and densities by .,_i(-|ui, z) for 
j = l,...,d, respectively. Then one obtains a sample from U\C{U) = z using the following 
algorithm. 



^If there is more than one sample j with \C{u) — Zji\ < e, choose j' such that \C{u) — Zjij\ < \C{u) — Zji\ "ij. 
^The distribution of Ud\{U-i — ui, Ud-i ~ Ud~i, C{U) = z) actually does not need to be determined because 
the value of Ud is uniquely given through the conditioning variables. 
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Algorithm 3.4 (Conditional inverse method). Let z G (0, 1). 

(i) Obtain {d — 1) uniform samples vi, ...,Vd-i- 

(a) Forj = l,...,d-l: Uj := F7^^^ ^^._^{vj\ui, ...,Uj-i, z). 

(Hi) Set Ud := C-^\^^^^^_^{z). 

(iv) Return sample {ui, ...,Ud)- 

The problem however is to determine the conditional distribution functions which are gen- 
erally not given in closed form. 

Theorem 3.5 (Conditional distributions). Let U ^ C , then \/j = 1, ...^d 

Fj\i,...,j-i{u\ui,...,uj-i,z) = -— — > «e (C„^,...,„,_i(2;),l), (3.1) 



1 /•! » 

-1 (^w ^ r^-i 



where 

9{uj) = I ••• / ^ c{ui,...,Ud-i,C;^^\,„^ua-A^))-^C;;^\...,ua-ii^) dud-i...duj+i 

(3.2) 

The proof can be found in Appendix [A} 

Evidently the conditional distribution functions given in (3.1) in general do not allow for 
explicit expressions. In particular, if C^^ is not available in closed form such as for the Gaussian 



copula, the expression in (3.1) hardly simplifies. 

In the case of exchangeable Archimedean copulas the conditional distribution functions can 
however be obtained in closed form. 

Lemma 3.6 (Conditional distributions of exchangeable Archimedean copulas). Let U ^ C, 

where C is a d- dimensional exchangeable Archimedean copula, then Vj = 1, 

= h~ y(z)-x:['^^^.y(^,) J ' (3-3) 

The proof can again be found in Appendix [A} 



Lemma 3.6 then allows to restate Algorithm |3.4| for exchangeable Archimedean copulas, for 



which the conditional distribution functions can easily be inverted in closed form. 

Algorithm 3.7 (Conditional inverse method for exchangeable Archimedean copulas). Let C be 

an exchangeable Archimedean copula with generator ip and z G (0, 1). 

(i) Obtain {d — 1) uniform samples vi, ...,Vd-i. 

(U) For j = 1, d-1: := - v'/'^'-'^X^iz) - Ei<.<, vi^^)))- 

(Hi) Set Ud ■■= f~'^{(p{z) - Y.i<i<dfi'^i))- 
(iv) Return sample (ui, ...,Ud)- 
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Figure 5: Left panel: scatter plot of a sample of a bivariate Clayton copula with parameter 6 = 2 
at z = 0.2. Middle and right panel: 3D scatter plot and pairs plot of a sample of a trivariate 
Clayton copula with parameter 6 = 2 at z = 0.2. 



Step (in) could also be performed in step (ii). It is however not necessary to use an additional 
uniform sample Vd- This is why steps (ii) and (Hi) are shown separately. 

For illustration Figure [5] shows scatter plots of samples from bivariate and trivariate Clayton 
copulas with parameter 9 = 2. Notice in particular the difference to the right panel of Figure |4j 



As a side note, one observes that Algorithm 3.7 can in particular be used to sample from a 



given exchangeable Archimedean copula. This is described in the following algorithm of which 



an equivalent version has previously been stated in Wu et al. (2007) The idea is to first simulate 



the contour level and then sample conditional on that level set, since 

P{U = u) = P{C{U) = z)P{U = u\C{U) = z). 
Algorithm 3.8 (Sampling from exchangeable Archimedean copulas). Let C be an exchangeable 



Archimedean copula with generator (f and Kendall distribution function K (see Theorem 2.4), 
whose inverse is assumed to exist. 

(i) Obtain d uniform samples vi, ...,Vd. 

(ii) Set z := K-\vd). 

(ill) Forj = 1, ...,d: uj := - vf''~'^){^{z) - Zl<^<j ■ 

(iv) Return sample {ui, ...,Ud). 

3.1.3 Projected distribution 

Clearly the condition C{U) = z on the distribution of U means that one is in fact investigating 
a {d — l)-dimensional distribution, namely the distribution of U projected onto the level set 
C{U) = z which is a (d — l)-dimensional manifold in [0, 1]^. However this distribution is not 
easily tractable in general. 

Again in the case of exchangeable Archimedean copulas some convenient results are ob- 



tainable. McNeil and Neslehova (2009) show that if [/ ~ C, where C is an exchangeable 



Archimedean copula with generator then 

{^{Ui),...,^{Ud)) = RS, 
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where S = {Si,...,Sd) is uniformly distributed on the unit simplex {x > : Yl'j=i^k 
[d — l)-dimensional manifold in [0, l]'^, and the radial part 

d 



1}, a 



R 



has distribution Fji which can be determined through the Williamson transform (see McNeil 



and Neslehova (2009)). 



Recall that C{u) = (p ^{Ylj=if{''^j)) holds for exchangeable Archimedean copulas. Fixing 
the contour level C{U) = z is thus equivalent to setting R = ip{z), so that 

[u\c{u) ^z]^ {ip~\sMz)), 'P-\SM^))) . 

This result can then be used to provide another sampling algorithm for U\C{U) = z, which 
can be shown to be equivalent to Algorithm |3.7[ when using explicit expressions for the sample 
(si, Sd) from S in terms of uniform random variables (see Hering (2011, Lemma 3.1.8)| ). 

Algorithm 3.9 (Projected distribution sampling for exchangeable Archimedean copulas). Let 

C he an exchangeable Archimedean copula with generator ip and z G (0, 1). 

(i) Obtain a sample (si, Sd) from S. 
(a) For j = l,...,d: Uj := {p~^{sjip{z)). 
(Hi) Return sample {ui, ...,Ud)- 

When also sampling the contour level to obtain observations from the Archimedean copula, 



this is again the approach proposed by Wu et al. (2007) which has been restated by Hering 



(2011) in the setting of the work by McNeil and Neslehova (2009) as I use it here. 



As stated above, it is an open problem to determine this projected distribution for general 
copulas. Analogous sampling methods to Algorithm |3.9| could be used then. 



3.2 Estimation and model selection 

Estimation of model parameters in hierarchical Kendall copulas can either be performed stepwise 
or jointly when the joint density expression is available. Both estimation methods are discussed 
here and evaluated and compared in an extensive simulation study. Finally some remarks 
regarding model selection are made. 



3.2.1 Two-step estimation 



The hierarchical construction given in Definition 2.6 directly leads to a two-step estimation 
procedure of hierarchical Kendall copulas. 

Algorithm 3.10 (Two-step estimation of hierarchical Kendall copulas). Let (wj)j=i^..,^Ar be 
a sample of a hierarchical Kendall copula as defined in Definition 2.6 (uj = {uji, ...,Ujn) for 
j = 1,...,N). Further let 6q, 01, ...,6d be the parameters of the copulas Co,Ci, ...,Cd, respectively. 
One then obtains corresponding estimates 6i, i = 0, ...,d, as follows. 



(i) For i = 1, ...,d estimate 6i based on {uj^rni_i+i, ■■■,Uj,mi)j=i,. 
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(a) Estimate Oq based on Vji := Ki{Ci{uj^rni^-^+i, ■■■,Uj^mi\0i)\0i), i = 0, ...,d, j = 1, N . 

The Kendall distribution functions Ki, i = 1, ...,d, depend on 6i through Ci as indicated by 
the notation. 

As briefly noted before, inference of hierarchical Kendall copulas poses two main challenges: 
In order to move up the hierarchy, the copula distribution functions Ci and the Kendall distri- 
bution function Ki have to be known for alH = 1, ...,d. While this is the case for (exchangeable) 
Archimedean copulas, this is not true in general. In particular, for the popular class of elliptical 
copulas neither the copula nor the Kendall distribution functions are known in closed form. 

These two issues can be tackled as follows: If the copula distribution function Ci is unknown, 
one can substitute it by the corresponding empirical version 

1 ^ 

Ci{Umi^l + l, ■■■,Umi) = ^|M-i.m.- ]+l<Mm. .i+lv..,Mj.m,-<Mm.}' 

i=i 

since possibly high-dimensional integration of the density expression is not feasible in general. 
The accuracy of the empirical copula can be improved by obtaining a large sample of the copula 
under consideration. Other non-parametric estimators for the copula distribution functions may 
be used but are potentially computationally more demanding. 
Having computed transformed data 

Zji := Cj(Mj^m.i_i+l) •••) "^jjUii l^i)) ^ ~ ^1 •■■jd, j = 1, 

one possibility to get an approximation of the Kendall distribution function Ki is simply to take 
the empirical distribution function of (^;jj)j=i,...,Ar, i.e., 

1 ^ 

^^(') = nT.M^.,<^}- (3-4) 
i=i 

Alternatively I propose to use a kernel estimator 

1 ^ 

E 



^^(^) = ]^2.«(^)' (3.5) 



where G is a kernel distribution function and h an appropriately chosen bandwidth (see, e.g., Li 



and Racine (2006) ). If few observations are available, this approach yields a smoother function. 
In particular, if Ui is large, Ki may be close to 1 (see Figure [T]) so that a smoother estimate 
may be helpful to capture the properties of the distribution function evaluated close to 0. Other 



possible approaches may be splines (see, e.g., Xue and Wang (2010)) 



If one is only interested in the nesting copula, one can simply use the non-parametric methods 
described above for the clusters and then only estimate the parameters of the nesting copula. 
Moreover this two-step estimation procedure directly generalizes to a fe-step estimation approach 



for k level hierarchical Kendall copulas as defined in Definition 2.7 



3.2.2 Joint estimation 



Given the joint density function (2.5) of a hierarchical Kendall copula (assuming that condition 
(2.4) holds), parameters can also be estimated jointly using the joint log likelihood expression 

N 

£!c{eo,ei,...,ed) = Y,^ogc!ciu,\9o,ei,...,ed). (3.6) 
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As starting values for a joint maximum likelihood estimation, stepwise estimates as described 
above can be used. 

For this to be practically feasible the copula and Kendall distribution functions of the cluster 
copulas have to been known. If this is not the case, empirical versions can be used to approximate 
the joint log likelihood expression. When the joint log likelihood expression is however available 
common model selection techniques such as AIC and BIC can be used to check the model fit. 



3.2.3 Simulation study 

In order to validate the estimation procedures introduced above, I performed a large scale 
Monte Carlo study. For this I simulated from a four-dimensional hierarchical Kendall copula 
(two bivariate clusters) and then estimated the parameters according to the following methods: 

• Sequential (two-step) estimation with known Kendall distribution functions. 



• Sequential estimation with empirical Kendall distribution functions (3.4). 



• Sequential estimation with kernel estimator (3.5) for the Kendall distribution functions 
(Epanechnikov kernel). 

• MLE with known starting values (true parameters). 

• MLE with sequentially estimated starting values. 

The cluster copulas Ci and C2 were chosen as Clayton, Gumbel or Frank; the nesting copula Co 
as Gaussian, Clayton, Gumbel or Frank. Parameters were determined according to Kendall's r 
values of 0.4 and 0.7. Sample sizes were 250, 500 and 1000 and the number of repetitions was 
100. 

MSEs of the estimated nesting copula parameter (transformed to Kendall's r values) 
according to the above five procedures are shown in Figure [6] for the case of Clayton and 
Gumbel copulas; the other cases are not shown here due to lack of space and since it turns out 
that results are essentially independent of the chosen cluster copulas. 

Similarly, the choice of parameters of the cluster copulas does not have a great infiuence 
on the results, while larger nesting parameters mean more accurate results. The results are 
however not independent of the nesting copula. For the Frank nesting copula, there is hardly any 
difference between the five estimation procedures, but overall, MLE and sequential estimation 
with known Kendall distribution functions are best. When the Kendall distribution functions 
are unknown, the standard empirical estimator is superior to the kernel estimator of the Kendall 
distribution functions in the case of a Clayton nesting copula and vice versa if the nesting copula 
is a Gaussian or a Gumbel. Finally, while MSEs are generally rather small, there also is a clear 
positive effect of an increasing sample size. 

To summarize, especially when the Kendall distribution functions are known, estimation of 
hierarchical Kendall copulas is very accurate. When they are not known, no clear recommenda- 
tion can be made whether to use a standard empirical or a kernel estimator. 



3.2.4 Model selection 



In practical applications the clusters {Umi_i+i, ■■■,Umi),i £ {^T--id}, have to be identified. In 
cases where they are not given from the data (e.g., industry sectors in financial data) 



com- 



mon clustering techniques (see, e.g., Hastie et al. (2009)) can be used after having specified a 
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Sectors Fin. Chcm. Hcaltc. Auto. Ind. Retail IT Util. Transp. 



Mean pairw. Kendall's r 


0.41 


0.33 


0.21 


0.39 


0.38 


0.26 


0.28 


0.56 


0.29 


Est. deg. of freedom 


9.01 


10.97 


24.41 


12.96 


8.85 


10.67 


9.08 


4.75 


7.37 



Table 2: Mean pairwise empirical Kendall's r and estimated degrees of freedom of a Student-t 
copula for each cluster. 



measure of closeness among variables. If a multi-level hierarchical Kendall copula is considered, 
hierarchical clustering methods may be particularly helpful. 

Recalling the discussion at the end of Section |2.1| the size of clusters has to be carefully 
chosen because Kendall distributions may become almost degenerate at for very large clusters. 
In most practical situations this is however not an issue, since already under medium positive 
dependence the convergence to the constant function at 1 is very slow (see the right panel of 
Figure [T]). 



4 Application 



Finance is a major field, where copulas are used for dependence modeling (see, e.g., Cherubini 



et al. (2004)). Often financial data exhibits some kind of clustering structure such as industry 
sectors and national stock markets. For such data, hierarchical Kendall copulas are very suitable. 
To investigate the usefulness of this newly proposed class of dependence models and to illustrate 
the presented inference techniques, the most important German stock market index DAX are 
analyzed. 

The DAX is composed of 30 major German stocks. For these I identified ten industry sec- 
tors: financials (Allianz, Commerzbank, Deutsche Bank, Deutsche Borse, Munich Re), chemicals 
(BASF, Bayer, K+S, Linde), healthcare (Fresenius, Fresenius Medical Care, Merck), automo- 
bile (BMW, Daimler, Volkswagen), industrials (MAN, Siemens, ThyssenKrupp) , retail and con- 
sumer goods (Adidas, Beiersdorf, Henkel, Metro), IT and communications (Deutsche Telekom, 
Infineon, SAP), utilities (E.ON, RWE), transportation and logistics (Deutsche Post, Lufthansa), 
and building materials (HeidelbergCement). 

For all 30 stocks more than six years of log returns (January 2005 to July 2011) are considered, 
where the time series are separated into a training set of 1158 observations and a testing set 
of 500 observations for out-of-sample validation of my models. As it is common for copula 
modeling in finance, I preliminarily fit time series models to the marginal time series and then 
work with standardized residuals which are transformed to marginally uniform data by the 



probability integral transform (inference functions from margins (IFM) method by Joe (1997)). 
In particular, marginal GARCH(l,l)-models with Student-t innovations are chosen, which have 
been validated with appropriate tests. 

Table [2] shows the mean pairwise empirical Kendall's r and the estimated degrees of freedom 
of a multivariate Student-t copula for each cluster. Evidently, within-sector dependence is 
variable, since some clusters are more homogeneous than others. Also strong tail dependence, 
as indicated by small degrees of freedom, cannot be found in all clusters. 

I then fitted different hierarchical Kendall copulas to the training data set. Results (log 
likelihood, AIC, BIG) are reported in Table [3j As building block copulas I considered Glayton, 
Gumbel and Frank (Archimedean) as well as Gaussian and Student-t (elliptical) copulas to 
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Moael 


Far. 


z-otep LogliK. 


Jomt MLrj Loglik. 


Increase 






(Clayton, Student-t) 


55 


6593.28 


6677.73 


1.2% 


-13245.46 


-12967.47 


(Gumbel, Student-t) 


55 


6980.12 


6992.29 


0.2% 


-13874.57 


-13596.58 


(Frank, Student-t) 


55 


7178.55 


7190.29 


0.2% 


-14270.59 


-13992.59 


(Gaussian, Student-t) 


82 


7587.05 






-15010.09 


-14595.63 


(Student-t, Student-t) 


91 


7854.31 






-15526.61 


-15066.66 


(Glayton, Clayton) 


10 


5452.34 


5471.09 


0.3% 


-10922.17 


-10871.63 


(Gumbel, Gumbel) 


10 


5860.74 


5862.93 


0.0% 


-11705.85 


-11655.31 


Multivariate Gaussian 


435 


8422.09 






-15974.19 


-13775.50 


Multivariate Student-t 


436 


8861.91 






-16851.82 


-14648.08 



Table 3: Estimation results. Notation of models: (cluster copulas, nesting copula). Correlation 
matrices of multivariate Gaussian and Student-t copulas are estimated using inversion of pairwise 
empirical Kendall's r's, while the degrees of freedom parameter is estimated via MLE given that 
correlation matrix. AIC and BIC values are based on the joint MLE if available (best three 
values are indicated in bold). 



account for different dependedence structure. The specified models are shown in the first column 
of Table [3j Note that no results of models with Gaussian nesting copulas are displayed. This is 
because those models always performed inferior to respective models with Student-t copula. 

For comparison I also fitted multivariate Gaussian and Student-t copulas, for which however 
no joint MLE was possible due to the large number of parameters. Also no joint MLE for 
hierarchical Kendall copulas with Gaussian and Student-t cluster copulas was performed due 
to uncertainty in the Kendall distribution function. Since two-step estimation provides good 
estimates (see column "Increase" in Table [s]), this is however not a severe limitation. 

A 30-dimensional hierarchical Archimedean copula could not be fitted for comparison, since 



its density expression is hardly tractable and numerically very involved (see Savu and Trede 



(2010) ). A stepwise approach by fitting clusters first and then the nesting Archimedean copula 



fails due to the dependence restrictions of hierarchical Archimedean copulas (see Section 2.3): 
while there is moderate dependence within some clusters (cp. Table[2]), there still is considerable 
dependence among clusters, which cannot be modeled using a hierarchical Archimedean copula. 

The hierarchical Kendall copulas therefore benefit from not having such dependence re- 
strictions. Moreover, ten-dimensional Student-t nesting copulas appear more reasonable than 
exchangeable Archimedean nesting copulas (with only one parameter). With respect to cluster 
copulas, symmetric copulas (Prank, Gaussian, Student-t) are superior to asymmetric ones (Clay- 
ton, Gumbel). The larger flexibility when using Gaussian and Student-t copulas is reflected in 
smaller information criteria, although more parameters are required. In comparison to standard 
multivariate Gaussian and Student-t copulas, hierarchical Kendall copulas perform quite well, 
in particular when taking into account the enormous number of parameters of these models. 



4.1 Value-at-Risk forecasting 

In finance, interest is however not so much in a good in-sample fit but rather in out-of-sample 
validation. A typical exercise for this is Value-at-Risk (VaR) forecasting. If the distribution of 
returns is continuous, the (1 — a)-VaR is the a-quantile of the distribution. For risk management 
this value needs to be predicted on a daily basis, which I do for the training set of 500 days 
using moving windows of length 1158. 

Forecasts are typically evaluated in terms of exceedances, that is, the event that the predicted 
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Model Level # Exceed. UC INDl IND2 CCl CC2 CC3 CC4 



Independence copula 


99% 


103 


0.00 


0.04 


0.00 


0.00 


0.00 


0.00 


0.00 




95% 


135 


0.00 


0.15 


0.03 


0.00 


0.00 


0.00 


0.00 




90% 


157 


0.00 


0.17 


0.03 


0.00 


0.00 


0.00 


0.00 


Gaussian copula 


99% 


5 


1.00 


0.75 


0.79 


0.95 


0.35 


0.93 


1.00 




95% 


26 


0.84 


0.09 


0.74 


0.23 


0.35 


0.98 


1.00 




90% 


52 


0.77 


0.46 


0.87 


0.73 


0.03 


0.29 


0.73 


Student-t copula 


99% 


4 


0.64 


0.80 


0.21 


0.87 


0.93 


0.75 


0.95 




95% 


24 


0.84 


0.12 


0.73 


0.29 


0.39 


0.89 


1.00 




90% 


53 


0.66 


0.53 


0.85 


0.74 


0.05 


0.31 


0.73 


Kendall (Gaussian) 


99% 


5 


1.00 


0.75 


0.82 


0.95 


0.35 


0.93 


1.00 




95% 


27 


0.69 


0.08 


0.72 


0.20 


0.31 


0.97 


1.00 




90% 


56 


0.38 


0.46 


0.84 


0.51 


0.03 


0.23 


0.65 


Kendall (Frank) 


99% 


2 


0.13 


0.90 


0.46 


0.31 


0.35 


0.43 


0.82 




95% 


31 


0.23 


0.04 


0.76 


0.06 


0.21 


0.55 


0.92 




90% 


57 


0.31 


0.52 


0.81 


0.48 


0.03 


0.22 


0.64 


Kendall (Student-t) 


99% 


5 


1.00 


0.75 


0.80 


0.95 


0.35 


0.93 


1.00 




95% 


28 


0.55 


0.07 


0.74 


0.16 


0.32 


0.90 


1.00 




90% 


57 


0.31 


0.52 


0.78 


0.48 


0.03 


0.22 


0.64 



Table 4: P- values of VaR backtests for hypotheses of independence and (un) conditional coverage. 
Hierarchical Kendall copulas use a Student-t nesting copula and cluster copulas as indicated in 
brackets. 



VaR is exceeded by the observed return. For 500 forecasts on average 500 x a exceedances are 
expected. Whether the number of exceedances {unconditional coverage) and their occurrences 
(should be independent; both properties: conditional coverage) are appropriate can be evaluated 
using a range of tests (backtests) that have been proposed in the literature: the proportion of 



failures test of unconditional coverage by Kupiec (1995) (UC), the Markov test of independence 
by [Christoffersen (1998) (INDl), the joint test of conditional coverage by Christoffersen (1998) 



(CCl), the mixed Kupiec test of conditional coverage by Haas (2001) (CC2), the Weibull test 



of independence by Christoffersen and Pelletier (2004) (IND2), and the duration-based GMM 



test of conditional coverage by Candelon et al. (2011) (CC3 and CC4 with orders 2 and 5, 
respectively) . 

Here, the Value-at-Risk of an equally weighted portfolio of the 30 DAX stocks is forecasted. 
Backtesting results of the following six different models can be found in Table |4j 

• independence copula (for comparison), 

• Gaussian and Student-t copulas, 

• hierarchical Kendall copulas with Gaussian, Frank and Student-t cluster copulas and with 
Student-t nesting copula. 

In summary, none of the hypotheses of independence and (un)conditional coverage can consis- 
tently be rejected for any of the VaR levels and for any of the models — except for the multivariate 
independence copula as expected. The weak lack of conditional coverage at the 90% level as 



detected by the mixed Kupiec test of Haas (2001) is not supported by the other tests. 



20 



This shows that hierarchical Kendall copulas are as good as the common Gaussian and 
Student-t copulas when it comes to out-of-sample validation. In contrast to these models, the 
hierarchical Kendall copulas are however much more parsimonious and have straightforward 
interpretation in terms of sectorial dependence. In particular, the hierarchical Kendall copula 
with Frank cluster copulas is very parsimonious and allows for closed-form calculations and very 
efficient simulations due to its Archimedean clusters. 



5 Conclusion 

In this paper I introduced the new class of hierarchical Kendall copulas. By grouping variables 
in clusters, this copula construction method overcomes the curse of dimensionality and thus 
allows for high-dimensional applications. The use of arbitrary cluster and nesting copulas and 
potentially several nesting levels allow for very flexible models. Appropriate statistical inference 
techniques have been presented in detail and illustrated in a substantial application to German 
stock returns. 

Open research questions particularly involve efficient sampling methods for non- Archimedean 
copulas, since rejection sampling is rather slow. In particular, for the popular class of elliptical 
copulas, this is an important challenge. Alternative estimation and model selection techniques 
are also subject of future research. 
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A Proofs 

Proof of Theorem 



3.5 



The idea is to derive the conditional density fj\i^...j-i{-\ui, Uj-i, z) 
and then integrate to obtain the distribution function. One begins by observing that 

f f I \ f{ui,...,Uj-l,Uj,z) 

/,|i,...,_i(n>i,...,n,_i,z) = • (A.1) 

The numerator can then be rewritten as 

..,Uj = Uj,C{U) = z) 

P(Ui = Ml, Ud-i = Ud-i,C{U) = z) dud^i...duj+i 

P(Ui = ui,...,Ud-i = Ud-i,Ud = C'^iy..,nrf„i(2;)) dud-i-.-duj+i 

d 



f{ui, ...,Uj,z) = 


P{Ui 


= ui, Uj = 








- /' . - 


/-. 

Cu , . . 


P{Ui 








- /' . - 




P{Ui 








- /' . - 




c(mi, . 


= 9{uj), 







c(Ml,...,Md-l,C„iy..,nrf_i(^))^C'«l^...,«d-l(^) dUd-l...dUj+l 
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where g is defined in (3.2) 



Further the denominator of (A.l) then reads a^ 

f{ui,...,Uj-i,z) = _^ 9{uj) duj. 

By integration one then obtains the expression for the conditional distribution function given 



in (3.1) 



□ 



Proof of Lemma 3.6: First observe that the density of an exchangeable Archimedean copula 
C with copula distribution function 

C{ui,...,Ud) = ^-\ip{ui) + ... + ip{ud)) (A.2) 

only depends on ui, ...,Ud through the first derivatives of ip and through C{ui, ...,Ud). 

d d 

c{ui,...,Ud) = {ip-^)^'^\'f{ui) + ... + <f{ud))Wv'{u,) =: h{C{ui,...,Ud))\{^'{ui). (A.3) 



i=l 



i=l 



To see this observe that any derivative of an inverse is a function of derivatives of the original 
function applied to the inverse, i.e., {f~^)^'^\x) = f{f~^{x)), d £N, for an appropriately chosen 



/. The rest follows from the definition of exchangeable Archimedean copulas (A.2) 



Further for exchangeable Archimedean copulas the inverse copula distribution function C ^ 
can easily be determined in closed form as (see Example |2.2[) 



Ui 



and its derivative as 



d_ 
dz 



a 



(A.4) 



(A.5) 



Following Theorem 3.5 one then determines the function g given in (3.2) by plugging in 



Expressions (|A.3|)-(|A.5|): 

9{uj) 



d 



c{ui,...,Ud^i,C^^\„^^^_^{z))—C^^\^^^^^^^{z) dud^i...duj+ 



d-l 



j 



H^) Jl v'{ui)ip'{z) dud-i...du 



j+i 

Ui] I _^ ip'{uj+i)... / _^ ip'{ud-i) dud-i...duj+i, (A.6) 

i=l Cui (^) *-^i'l.---."d-2 



smce 



and 



d 



h{C{ui,...,Ud^i,C-l,^,^_^{z))) = h{z). 



''in the case j = 1 this is the density expression of the KendaU distribution function ( |2.1[ ). 
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Next, one iteratively solves the nested integrals in (A.6). First 



"d-2 = l 

"d-2=C'u]^,...,iijj_3 (z) 



Cui,...,u^_2i^) 

The second integral is then given by 

= i (c/,(n,_3) + (Ei<,<,_3 - ^(")))' ■ 

Similarly, the third integral computes to 

■J^nj^,...,u^_^(.Z) 

= ... = II (c^(n,_4) + (Ei<.<._4^(^^) - ^^'^)y 

By continuing iteratively, one finally arrives at 
and thus 

Cui,...,Uj_l (^) 

— — — — |J_ / _^ if'iUj) [^^(Uj) + (2^l<^< • ~ '^(^^ ' ' 

V J /■ j — ]^ -'C'uj.....u (z) — 



.^^ {d- j - 1)! V^i<i<i+i 



1 VrV, . 1 / . , ^ . . . 



1 ■ ■ / . ^ \ d—j 



By plugging u = 1 into (A.7), one further obtains 



j _^ g{uj)duj = h{z)ip'{z) H ^'{ui) (Ei<i< • '^(^^) ~ '^(^)) • (^•^) 

Cu, 1(2) V ''''' i — 1 ~ 



Combining Equations (A.7) and ( |A.8 ) as in (3.1) then gives 



Fj\i^,„j_i{u\ui,...,uj-i,z) = -y—-;- = ; ra^j 



d-j 



^C^L.u^_,i^) (Ei<.<, - ^(^))' 
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which is the desired result. 



□ 
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